Scalable Load Balancing in Networked Systems: A Survey of Recent Advances
نویسندگان
چکیده
In this survey we provide an overview of recent advances on scalable load balancing schemes which favorable delay performance and yet require minimal implementation overhead. The basic scenario involves a single dispatcher where tasks arrive that must immediately be forwarded to one $N$ single-server queues. join-the-shortest-queue (JSQ) policy yields vanishing delays as grows large, in centralized queuing arrangement, but prohibitive communication burden. contrast, JSQ($d$) assign incoming task server with the shortest queue among $d$ servers selected uniformly at random little communication, lead constant delays. order examine fundamental trade-off between overhead, discuss body research JSQ($d(N)$) diversity parameter $d(N)$ depends investigate growth rate required match optimal JSQ fluid diffusion scales. Stochastic coupling techniques scaling limits play instrumental role establishing asymptotic optimality. We demonstrate how methodology carries over infinite-server settings, finite buffers, multiple dispatchers, arranged graph topologies, token-based such join-the-idle-queue (JIQ), thus providing broad main trends field.
منابع مشابه
Load Balancing Approaches for Web Servers: A Survey of Recent Trends
Numerous works has been done for load balancing of web servers in grid environment. Reason behinds popularity of grid environment is to allow accessing distributed resources which are located at remote locations. For effective utilization, load must be balanced among all resources. Importance of load balancing is discussed by distinguishing the system between without load balancing and with loa...
متن کاملScalable Load Balancing in Cluster Storage Systems
Enterprise and cloud data centers are comprised of tens of thousands of servers providing petabytes of storage to a large number of users and applications. At such a scale, these storage systems face two key challenges: (a) hot-spots due to the dynamic popularity of stored objects and (b) high reconfiguration costs of data migration due to bandwidth oversubscription in the data center network. ...
متن کاملScalable Load Balancing in Networked Systems: Universality Properties and Stochastic Coupling Methods
We present an overview of scalable load balancing algorithms which provide favorable delay performance in large-scale systems, and yet only require minimal implementation overhead. Aimed at a broad audience, the paper starts with an introduction to the basic load balancing scenario – referred to as the supermarket model – consisting of a single dispatcher where tasks arrive that must immediatel...
متن کاملScalable Load-Distance Balancing
We introduce the problem of load-distance balancing in assigning users of a delay-sensitive networked application to servers. We model the service delay experienced by a user as a sum of a network-incurred delay, which depends on its network distance from the server, and a server-incurred delay, stemming from the load on the server. The problem is to minimize the maximum service delay among all...
متن کاملScalable Load Balancing
In this paper, we develop load balancing strategies for scalable high-performance parallel A* algorithms suitable for distributed-memory machines. In parallel A* search, ineeciencies such as processor starvation and search of nonessential spaces (search spaces not explored by the sequential algorithm) grow with the number of processors P used, thus restricting its scalability. To alleviate this...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Siam Review
سال: 2022
ISSN: ['1095-7200', '0036-1445']
DOI: https://doi.org/10.1137/20m1323746